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ABSTRACT 

We develop an abstract framework and convergence theory for Galerkin approximation 
for inverse problems involving the identification of nonautonomous nonlinear distributed 
parameter systems. We provide a set of relatively easily verified conditions which are suffi- 
cient to guarantee the existence of optimal solutions and their approximation by a sequence 
of solutions to a sequence of approximating finite dimensional identification problems. Our 
approach is based upon the theory of monotone operators in Banach spaces and is appli- 
cable to a reasonably broad class of nonlinear distributed systems. Operator theoretic and 
variational techniques are used to establish a fundamental convergence result. An example 
involving evolution systems with dynamics described by nonstationary quasi-linear elliptic 
operators along with some applications are presented and discussed. 
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1. Introduction 


In this paper we develop a general abstract approximation framework and convergence theory 
for Galerkin approximations for inverse problems involving nonautonomous nonlinear distributed 
parameter systems. We consider parameter estimation problems formulated as the minimization 
over a compact admissible parameter set, of a least-squares-like performance index subject to state 
constraints given by an inhomogeneous nonlinear distributed system. Our theory applies to 
systems whose dynamics can be described by nonstationary strongly maximal monotone 
operators defined on a reflexive Banach space which is densely and continuously embedded in a 
Hilbert space. This class of operators represents a nonlinear analog of the class of regularly 
dissipative, or abstract parabolic, linear operators. Our treatment relies heavily on the general 
theory for nonlinear evolution equations in Banach space given by Barbu in [9]. More specifically, 
we make extensive use of his Theorem III. 4.2 which serves as the basis for an existence, 
uniqueness, and regularity result that we require and for the fundamental convergence result we 
prove in section 3 below. 

We employ standard Galerkin techniques to obtain a sequence of approximating identification 
problems, each of which involves finite dimensional state constraints. We demonstrate that if 
readily verifiable conditions on the system's dependence on the unknown parameters are satisfied, 
and the usual assumptions necessary for the convergence of Galerkin approximations hold, then 
solutions to the finite dimensionally constrained problems exist and, in some sense, approximate a 
solution to the original infinite dimensional identification problem. 

Our treatment here and the approximation theory we develop generalizes and extends earlier 
results of ours and others in some significant ways. First, the variational approach that we use to 
establish our fundamental convergence result generalizes the techniques that have been widely used 
for this purpose elsewhere in the parameter identification literature (see, for example, [3], [5], [8]). 
However, in each of these instances, the variational arguments were given strictly in the context of 
particular inverse problems involving the identification of rather specialized linear distributed 
parameter systems. Second, the results given here extend the abstract approximation theories for 
autonomous linear system identification, and for quasi-autonomous nonlinear system identification 
developed in [4] and [7], respectively, to inverse problems for systems whose dynamics are 
described by temporarily inhomogeneous nonlinear operators. We note that although we have 
extended the theories developed in [4] and [7], the approach we have taken here in achieving this 
end is significantly different. Indeed, in [4] and [7] convergence is argued via an application of an 
abstract approximation result for evolution systems in Banach space such as the Trotter Kato 
Theorem (see [15]) or its nonlinear analogs (see, for example, [10] and [1 1] ). While the 
approximation results given in [10] and [11] could be applied in the nonautonomous case, this 
would require that the resolvents of the time dependent system operators satisfy a Lipschitz-like 
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condition with respect to the time variable (see [ 7 ]). However, a condition on the resolvent is, in 
general, not easily verified for the infinite dimensional system and can be especially difficult to 
establish for the sequence of approximating finite dimensional systems. On the other hand, with 
the variational approach that we have taken here, a mildly restrictive, relatively easily verified, 
measurability condition on the temporarily varying system operator is the only assumption on the 
time dependence of the system that we require to establish existence of solutions and convergence. 

We provide a brief outline of the remainder of the paper. In section 2 we define the class of 
nonlinear evolution systems and identification problems on which we shall be focusing our 
attention and we establish an existence, uniqueness, and regularity result for solutions to 
nonautonomous nonlinear distributed systems. In section 3 we develop our approximation theory 
and prove the fundamental convergence result. In section 4 we present an example and discuss 
some applications. 

2 . Inverse Problems for a Class of Nonautonomous Nonlinear Distributed Systems 

Let be a metric space with Q, known as the admissible parameter set, a compact subset of 
Let Z, which we shall refer to as the observation space, be a normed linear space with norm 
M z . Let H be a Hilbert space with inner product <■,•> and corresponding induced norm M, and 
let V be a reflexive Banach space with norm INI. We assume that V is densely and continuously 
embedded in H. The latter assumption implies that there exists a constant |1 > 0 for which 
Itpl < pJItpll, for all cp eV. Let V be the space of continuous linear functionals defined on V and 
denote the usual dual space norm on V by INI*. Identifying H with its dual, we have V c H = 
H c V with H densely and continuously embedded in V . It follows that llcpll* < |i|(p|, for all 
9 e H and that II9II* < |I 2 ||9||, for all 9 e V. For 96 V* and yeVwe shall denote the duality 
pairing between 9 and \j / by <9, \j/>. Of course when 9 e H, the pairing <9, \j r> agrees with the 
usual H inner product of 9 and \| r. 

Let T > 0 be fixed, and for each q e Q and almost every t e [ 0 ,T] let A(t;q) be a 
hemicontinuous (i.e. w-lim o A(t;q)(9 + h\)/) = A(t;q)9 for 9, \| / e V) in general nonlinear, 

% 

operator defined on all of V with range in V . In our discussions below, we shall require that the 
family of operators A(t;q), t £ [ 0 ,T], q £ Q, satisfy the following conditions. 

• sj( 

(A) (Continuity) For each 9 £ V the map q -» A(t;q)9 is continuous from Q c % into V for 
almost every t e [ 0 , T]. 

(B) (Equi-Y-Monotonicity) There exist a constant co and a positive constant a which do not 
depend upon q £ Q or t £ [ 0 ,T] such that 

<A(t;q)9 - A(t;q)\p, 9 - \|/> + C0I9 - \|/| 2 > ot|]cp - \j/]|P 
for all 9,\j/ £ V, almost every t £ [ 0 ,T] and some p with 2 < p < °o . 
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(C) (Equi-Boundedness) There exists a constant P > 0 which does not depend upon q £ Q or 
t e [0, T] such that 

IIA(t;q)cpll* < pdltpIP' 1 + 1) 
for all tp e V and almost every t e [0, T] . 

(D) (Measurability) For each q £ Q the function A(t;q)u(t) : [0, T] V is strongly measurable 
for every u £ L p (0,T;V). 

•k 

For each q £ Q let u 0 (q) be an element in H and let f(- ;q) be a function in L p . (0,T; V ) where 
p and p' are conjugate exponents (i.e. 1/p + 1/p* = 1). We assume that the mappings 
q —> u 0 (q) and q -» f(- ;q) are continuous from Qc^ into H and L p .(0,T; V ) respectively. 

For each z £ Z, let <$(- ;z) be a mapping defined on L p (0,T;V) with range in the nonnegative real 
numbers and which is continuous when restricted to one or the other of the two spaces C(0,T;H) or 
L p (0,T;V) endowed with their respective usual topologies. We consider the following abstract 
parameter identification problem. 

(ID) Given observations z £ Z, find parameters q £ Q which minimize the performance index 

J(q) = <b(u(- ;q);z) 

where u(- ; q) is the solution to the initial value problem 

(2. 1) u(t) + A(t;q)u(t) = f(t;q), a.e. t e [0, T] 

(2.2) u(0) = u 0 (q) 
corresponding to q £ Q. 

The existence of a solution q £ Q to problem (ID) will follow as a consequence of the 
approximation theory and results to be presented in the next section. However, the notion of a 
solution to the abstract initial value problem (2.1), (2.2) for each q e Q must be made precise, and 
its existence, uniqueness, and regularity properties must be demonstrated. We have the following 
theorem. 

Theorem 2.1 If condition (B) - (D) are satisfied, then for each q £ Q there exists a unique 
> * 

function u(- ; q) which is V -absolutely continuous on [0,T], and satisfies u(- ;q) E L p (0,T;V) 
f) C(0,T;H), ii(- ;q) £ L p -(0,T;V ), the abstract differential equation (2.1) for almost every 
t £ [0, T], and the initial condition (2.2). 
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Proof For each q e Q and almost every t e [0,T] define A (0 (t; q) and f w (t; q) by A w (t; q) = 
e'® 1 A(t;q) e® 1 + col and q) = e'® l f(t;q). Then for almost every t e [0,T], A to (t;q) is a 

5 |$ 

nonlinear hemicontinuous operator from V into V and f tt (t;q) e L p .(0,T;V ). Also, condition (D) 

* ^ 

implies that for every u £ L p (0,T;V), the function of t, A (0 (t;q)u(t) : [0, T] — » V is strongly 
measurable. Conditions (B) and (C), and straight forward calculations yield the existence of an a M 
> 0 and a > 0 (both depending upon p)‘for which 

< A m (t;q)<p - (p - \|/> > a J(p - \|/||P , 

(i.e., that the operator A £0 (t;q) is strongly monotone) and 

II A C0 (t;q)cplU < p^lltpIlP' 1 + 1) 

hold for all cp,\]/ eV and almost every t e [0,T]. Finally, for all (p E V we have 

< A tt (t;q)cp, (p> = < A w (t;q)cp - A ffi (t;q)0, cp - 0> + < A w (t;q)0, cp> 

^ aJ|(p||P - I < A m (t;q)0, cp>l 
^ aJ|(p||P - II A m (t;q)0ll*ll(p|| 

> cxJ|cp||P - Pjltpll 

2 aJMI P - -%• - ~ »<P" P 
P'e P 

= Yj<Pll p + 

where e > 0 is chosen small enough so that Y u - a co “ eP/p > 0, 5 W = - P p /p’e p and 0 denotes the 
zero vector in V. Then an application of Theorem m. 4.2 in Barbu [9] yields the existence of a 
unique V -valued absolutely continous function u^O ;q) defined on [0, T] which satisfies 
uj-;q) £ Lp(0,T;Y) f) C(0,T;H), ii w (- ;q) e L p ,(0,T;V*), 

«co(pq) + = yqq)’ 

for almost every t £ [0, T] and u w (0;q) = u 0 (q). The conclusion of the theorem now follows 
immediately by setting u(t;q) = e® 1 u^f.q) for t E [0,T]. 

3. An Abstract Approximation Framework and Convergence Theory 

For each n = 1,2,... let H n be a finite dimensional subspace of H which is also contained in V. 
Let P n : H — > H n denote the orthogonal projection of H onto H n . We shall require that the 
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following convergence condition is satisfied by the subspaces H n and the corresponding 
projections P n . 

(E) For each cp £ V, we have li m o IIP n cp - cpll = 0 . 

Condition (E) and the Principle of Uniform Boundedness imply that there exists a constant v > 0, 
independent of (p e V and n, for which IIP n cp - cp|| < v||(p||. Note also that V densely and 

continuously embedded in H, IP n l = 1, and condition (E) imply jjmJP n cp - (pi = 0 for all cp E H. 

For each n = 1,2,..., each q e Q and almost every t e [0, T], we define the operator A n (t; q) : 
H n H n to be the restriction of the operator A(t;q) to H n with the image in V of cp n e 
A(t;q)(p n , considered to be a linear functional on Hj,. Identifying H n with its dual, for cp n e H n 
we obtain A n (t;q)cp n = \j/ n where \|/ n is that element in H n which satisfies < A(t;q)cp n , % n > = 

<Vm Xn> forall %n e H n • 

For each n = 1,2,..., and each q e Q we define u 0n (q) e H n by u 0n (q) = P n u Q (q) and for 
almost every t e [0, T] we define f n (t;q) to be the restriction of f(t;q) E V to f^. Note that the 
Riesz representation theorem implies that f n (- ;q) E L p .(0,T;H n ). We consider the following 
sequence of parameter identification problems. 

(ID n ) Given observations zeZ, find parameters q n eQ which minimize the performance index 

(3.1) J n (q) = 0(u n (-;q);z) 

where u n (- ;q) is the solution to the initial value problem in H n 

(3.2) u n (t) + A n (t;q)u n (t) = f n (t;q), a.e. t E [0,T] 

(3.3) ti n (°) = u 0n (q) 
corresponding to qEQ. 

The initial value problem (3.2), (3.3) in H n is the standard Galerkin approximation to the initial 
value problem (2. 1), (2.2). With appropriate minor modifications. Theorem 2. 1 yields for each 
n = 1,2,... and each q e Q the existence of a unique absolutely continuous function u n (- ;q): 

[0,T] — > H n which satisfies the finite dimensional ordinary differential equation (3.2) for almost 
every t £ [0,T], and the initial conditions (3.3), with u n (- ;q) £ L p ,(0,T;H n ). 

We would like to demonstrate that if the conditions (A) - (E) above are satisfied, then a 
solution to the inverse problem (ID n ) exists for each n = 1,2,..., and that these solutions imply the 
existence of, and in some sense approximate, a solution to problem (ID). In Theorem 3.1 below 
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we show that the mapping q — » u n (- ;q) is continuous from Qc ^ into both C(0,T;H) and 
Lp(0,T;V) and that li m u n (- ;q n ) = u(- jq^) in both C(0,T;H) and Lp(0,T;V) whenever 

{qjj} is a sequence in Q with lim q n = q Q . Theorem 3. 1 is in fact sufficient to conclude 

that the desired existence and approximation results obtain. Indeed, the continuous dependence of 
u n (- ; q) on q implies that for each n = 1,2,..., J n given by (3.1) is continuous on Q. Since Q 
was assumed to be a compact subset of the metric space the existence of a solution q n to 

problem (ID n ) follows immediately. Now {q n } cQ and Q compact imply the existence of a 

convergent subsequence {q n } of the sequence {cL}. If lim q = q , then qeQ and 

j j 

J(q) = 0>(u(- ;q );z) = d>( lim u n .(- ;q n .);z) 

j j 

= lim 0(u n (• ;q n .);z) = lim J n .(q n .) 

J-»°° j j j J 

< lim J n (q) = lim <&(u n (- ;q);z) 

j-»“ j J->°° j 

= 0( lim u n (• ;q);z) = 0(u(- ;q);z) 

j_»oo j 

= J(q) 

for every qeQ. Consequently q is a solution to problem (ID). We note that the limit of any 
convergent subsequence of { q n } is a solution to problem (ID). When problem (ID) admits a 
unique solution q, the sequence { q n } itself is convergent and its limit is q . 

Theorem 3.1 If conditions (A) - (E) are satisfied, then 

(i) . tirn^ u n (- ;qj = u(- ;q Q ) in C(0,T;H) and L p (0,T;V) whenever {q n } is a 

sequence in Q with lirn^ q„ = q 0 , and 

(ii) for each fixed n = 1,2,..., lim^UnO jq^ = u n (- jq^ in C(0,T;H) and L p (0,T;V) 
whenever { q m } is a sequence in Q with tim q^ = q^ . 

Proof We shall prove (i) only; the proof of (ii) is analogous. We establish (i) with an argument 
in the spirit of those in [3], [8] set in an abstract framework similar to the one used by Barbu to 
prove his Theorem III. 4.2. in [9]. Fix t £ [0,T] and define the Hilbert space % by % = L 2 (0,t;H) 
together with the inner product 

t 

(x,y) = J <x(s),y(s)>ds 
o 

and corresponding induced norm Ixl^ = 7 ( x » x ) • Define the reflexive Banach space 
Tf by V = L p (0,t;V) with norm 
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Ilxll v = ( J llx(s)!l P ds) 1/P 
o 

Then V = L p .(0,t;V ), the dual norm is given by 

t 

llxll v . = ( Jllx(s)l£ds) 1/P ' 

0 

and the dense and continuous embedding *1/ c W c V holds. 

For each q e Q, define the operator ift(q) : Dom (ifc(q)) cV -)V by <B(q)v = v, for 
v e Dom (B(q)) ={ueV : ii e V*, u(0) = u Q (q)} where the derivative in the above 
definition is in a generalized or distributional sense (see Lions, [13]). It is shown in [9] that the 
operator H3(q) is maximal monotone on V x V and that Dom (<fr(q)) c C(0,t;H). For each 
q e Q define the operator J3(q) : Tf — > Tf by 

( J?(q)x)(s) = A(s;q)x(s), xeV, a.e. se[0,t]. 

Using the properties of the operators A(s;q), it is not difficult to argue that A(q) is hemicontinuous 
and satisfies 

(3.4) (-fl(q)x - M q)y, x - y) + colx - yl^ > a llx - yll ^ 

for all x,y e Tf . In light of our proof of Theorem 2.1, it is clear that we may, without loss of 
generality take (0 in condition (B) and (3.4) above equal to zero. We shall do this in our 
discussions below. It follows (see [9]) that the operator <T(q) : Dom (<J3(q)) c Tf — > Tf given 
by (T (q) = <B(q) + j^(q) is maximal monotone on Tf xV* and that 7i(cT (q)) = Tf * . 
Consequently, the operator <T (q)'* : Tf* — > Dom (fli(q)) is well defined. Henceforth denoting 
u(- ;q 0 ) and f(- ;q 0 ) by u and f respectively, we have f £ Tf , u e Dom ((fr(q 0 )) and u = 

5r(qo)-‘f. 

For each n = 1,2,... let V n denote the linear space H n endowed with the V-topology. That 
is, V n is H n considered as a subspace of V rather than H. Let % n = L 2 (0,t;H n ) and Tf n = 

<{t 

Lp(0,t;Y n ). Then, since H n is finite dimensional, V n is H n endowed with the V -topology and 
Tf * = L p -(0,t;V* ). Define the operators fa n (q): Dom (B n (q)) cV n ->vJ, j^(q): Tf n -> Tf * 
and T n (q) : Dom (35 n (q)) c V n V* by 

B n (q)x n = x n , x„E Dom (<J3 n (q)) = {v n e V n : v n e Tf* , v n (0) = P n u Q (q)}, 
(^n(q)x n )( s ) = A n (s;q)x n (s), x„ e Tf n , a.e. s e [0,t], 
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and 


*r n (q) = ® n (q) + ^nCq)* 

respectively. As was the case above we have that B n (q) is maximal monotone, ^(q) (actually, 
^(q) + col) is strongly monotone on Tf n xV n and <fi,(<T n (q)) = < U“ n . Denoting 
u n C ;q n ) by u n and f n (- ;q n ) by f n we have that f n e V* and that u n = ?T n (q n ) -1 f n with 
u n 8 Dorn (<R n (q n )). 

Now for x e V and q 8 Q, condition (C) implies 

(3.5) ILq(q)xll v . = (JllA(s;q)x(s)l£ds) 1/P 

o 

f „p' p-i p' l/p' 

< (I p (llx(s)ll + 1) ds) 

0 

t t 

„ r (p-i)p’ i/p' r t/p’ 

<p{(| llx(s)!l ds) + (] ds) } 

0 0 
p-1 1/p' 

<p{llxll P v + T P } 

< Ptllxll^ 1 + 1}. 

This estimate, which also holds for \(q), in turn implies that there exists a constant M > 0, 
independent of t e [0, T] and n = 1,2,... for which llu n ll v < M, n = 1,2,.... Indeed, using 0 to 
denote the zero vector in V and noting that (^n(q)w n ,v n ) = (J?(q)w n ,v n ) for all w n ,v n in Tf n , (3.5) 
can be used to argue 

allUnll^. ^ (^(q n ) u n-^(qn) 6 > u n) 

= ($r n (q n )u n ,u n ) - (B n (q n )u n ,u n ) - (J3(q n )0,u n ) 

t 

= (f».u„) - \ J £ lu n (s)l 2 ds ■ Wq n )8,u n ) 

0 

£llf n ll v .llu n ll v + i| p „u 0 (q n )l 2 - j lu„(t;q„)l 2 + 11^)611 Ilujl^ 

S»f,- ffl v .«u„ll v + Ilfl^.lluJV+iVqJ-u^q/ +i lu 0 (q/ + f llu„ll„ . 


The continuous dependence assumptions on f(- ; q) and u Q (q), lim^ q n = q Q , and the 
application of the familiar inequality 

/ 

p p . p 

ab < + -A— , a, b > 0, e > 0 

P e p p' 

with e chosen sufficiently small, yields the desired uniform bound. 
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For x£%, define x n e% n by x n (s) = P n x(s), a.e. s e [0, t]. Then for x e Dom(B(q)), 
using the definition of generalized derivative, it is not difficult to argue that x n e Dom(B n (q)) and 
that (B n (q)x n - <J3(q)x, v n ) = 0 for all v n e TT n . 

It then follows that 

allu n - ull P v < (%)u n - .2(q n )u, u n - u) 

= (^(q n )u n - J3(q 0 )u, u n - u n ) + (^(q n )u n - ^(q 0 )u, u n - u) 

+ (J3(q 0 )u - J3(q n )u, u n - u) 

= (tT n (q n )u n - S' (q 0 )u, u n - u») - (B n (q n )u n - fa(q 0 )u, u n - u n ) 

+ (^(q n )u n - ^(q 0 )u, u n - u) + (^(q 0 )u - -3(q n )u, u n - u) 

= ( f n - f > U n ' U ") ’ (^n^iX ' ^n^o) Un > U n ' u ") 

+ (*(q n )u n - -^(q 0 )u, u n - u) + (^(q 0 )u - ^(q n )u, u n - u) 

t 

£llf n -fll v .llu n -u“ll v - I J £ Iu„(s) - u"(s)l 2 ds 

0 

+ ll^q n )u n - ^(q 0 )ull v , llu n - u!l v + II^Cq Q )u - ^qjull^. Ilu n - u« v . 

Condition (E), the uniform bound on llu n ll v , (3.5), and the final estimate above imply 
OIK - < < {M + (v + 1) Mull v ) llf n -fll v . 

+ j |u 0 (q n ) - u 0 (q 0 )|2 ‘ Y lu n(t;qn)-P n u(t;q 0 )i 2 

+ p(M P 1 + llull^ 1 + 2)llu n - ull v + — i— il^(q 0 )u - ,3(q n )ull p + llu n - ull^ . 

p'e p v p 

With e > 0 chosen sufficiently small, we obtain 

(3.6) j- lu n (t ; q„) - P n u(t;q Q )l 2 + (a - -|- ) lli^ - ull^ 

< { M + (v + l)lhill v ) llf n - fll v . + \ Iu 0 (q n ) - u 0 (q 0 )| 2 

+ P (M P 1 + IluC 1 + 2)llu n - ull + ~pT IL *<J<)) U - -^qn) ul C* • 

V V pi V 
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Conditions (A), (C), and (E), and the continuous dependence assumption on u 0 (q) and f(- ;q) 
imply that the right hand side of the estimate (3.6) tends to zero as n — > °°. Consequently the left 
hand side tends to zero as well. Moreover, by replacing any t dependence on the right hand side 
with T we find lirn^ u n (- jqJ = u(- ;q Q ) in L p (0,T;V) and lirn^ {u n (* jqJ - u n (- ;q Q )} = 0 

in C(0,T;H). However 

lu n(t;q n ) - u(t;q Q )l < lu n (t;q n ) - u"^) I + IP n u(t;q Q ) - u(t;q Q )l 

for t e [0, T]. Therefore, condition (E) and u(- ; q Q ) e C(0,T;H) imply that lim^u n (- ;q n ) = 
u(- ;q 0 ) in C(0,T;H) and the theorem is proved. 

Remark In practice, it is frequently the case that the parameter space ^ and the admissible 
parameter set Q are infinite dimensional with elements consisting of spatially and/or temporally 
varying functions. If this is the case, to actually solve the approximating identification problem 
(ID n ), in addition to making the state discretization, the admissible parameter set Q must be 
discretized as well. Briefly, this can be carried out as follows. For each m = 1,2,... let 
I m :Qc<3>— »^bea continuous map with finite dimensional range and the property that 
lim I m (q) = q, uniformly in q for q e Q. We set Q m = I m (Q) (note that for each m, Q m is a 

compact subset of <\) and consider the doubly indexed sequence of approximating identification 
problems (ID™) where (ID™) is the problem (ID n ) with Q replaced by Q m . Each of the problems 

m. 

admits a solution e Q and it can be argued that there exists a subsequence {q n J } c {q™} 

k 

m. 

for which lim q n J = q , with q a solution to problem (ID). (A more detailed discussion of 
j.k— »<» k 

the double discretization procedure outlined above can be found in [5].) Once bases for H n and the 
range of I m have been chosen, the problem (ID™) becomes one involving the minimization of a 
continuous functional over a closed and bounded subset of Euclidean space subject to finite 
dimensional (ODE) constraints. The resulting optimization problem is typically solved using an 
iterative search procedure, a variety of which have been implemented as a part of any one of a 
number of standard readily available software packages (for example, IMSL, MINPACK, etc.) 

(see [3]). 

4. An Example and Applications 

We present an example of a class of temporally inhomogeneous nonlinear operators and 
corresponding nonautonomous nonlinear evolution systems to which the general theory developed 
above applies. We also briefly outline a sampling of application areas which give rise to inverse 
problems involving both linear and nonlinear nonautonomous distributed systems of the type 
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discussed, in our example. We treat here the relevant theory only; implementation questions will be 
discussed and numerical findings are and will be reported on elsewhere (see e.g. [1], [8]). 

Let T > 0 be fixed, let Q be a bounded region in with smooth boundary 3 £2, and for 
a = (a 1 ,ct 2 ,...,oc^) a multi-index of nonnegative integers let |ot| = a 1 + o^+'-'+oc^ . We 
denote the a^ 1 order generalized, or distributional, derivative of a measurable function u defined 
on £2 by D a u; that is 


a , 


D a u(x) = -2 


1 


a i 

Sx^ 


— oT" u (x), 
3x * 


a.e. x e £2 


where a = (a^ a 2 ,..., a^). Let m be a fixed nonnegative integer, set N = i J , and 

j=o 

for u measurable on £2, let 8u denote the N-vector valued function whose components are D 0 ai for 
all multi-indices a with 0 < |a| < m. Define the metric space by 


N. 


= X L m ((0,T) X £2 x R 1 ’) 

0<la|,|P|<m 


and let Q be a compact subset of ^ with the properties given in (1) and (2) below. 

The vector valued function q = {q^ p} , 0 < loci, ipi < m, is an element in Q if 

(1) The real valued mapping £ — » q^p(t,x,Q defined on R N is continuous for almost every 
(t,x) e [0, T] x £2 and all multi-indices a and p with 0 < loci, Ipl < m. 

(2) There exists a positive constant A. which does not depend on q e Q for which 

(^pCt.X.O Cp - q^dATDTlp)^ - Tlcc) — ^ ^a — T lal 

0<lal,ipi<m 0<|a|<m 

for almost every (t, x) £ [0,T] x £2 and all q e R N . 

We take H = L2(£2) and let V be any closed linear subspace of H m (£2) which contains 
H^(£2). Then V* c H' m (£2), and for each q = {q a) p} E Q and almost every t e [0,T] define the 
operator A(t;q) : V — » V* by 

(4.1) <A(t;q)u,v> = ^ J q a p(t,x, 5u(x))D P u(x)D a v(x)dx 

0<|a|,|P|<m d 
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for u,v e V. The notation <•,•> appearing in the definition (4.1) above will be used to denote 
both the usual inner product on H = 1^(0.) and the duality pairing between V and V . The 
operator A(t;q): V — > V is the distributional form of the formal differential operator 

A(t;q)u(x) = V (-l) 1 ” 1 D a q a „(t,x, 8u(x))D |3 u(x) 

0<lal,IP|<m 

for almost every (t,x) e [0,T] x Q, and is said to be of quasi-linear elliptic type. It is not difficult 
to show that (1), (2) above imply that for each q e Q the operator A(t;q) given by (4.1) is 
hemicontinuous from V into V and that conditions (A) - (D) are satisfied. We note that since V is 
separable, weak and strong measurability are equivalent (see [12]). Consequently condition (D) 
can be verified by showing that the real valued mapping t -» <A(t;q)u,v>, u, v e V, is measurable 
in the usual sense on [0, TJ. When V is chosen as either Hg’(D) or H m (£>), the abstract evolution 
equation (2. 1) with A(t;q) V — > V given by (4.1) corresponds to the quasi-linear parabolic partial 
differential equation 

|j-(t,x)+ ^ (-1) N D a q a> p(t,x,5u(t,x))D P u(t,x) = f(t,x;q), a.e. x e Q, t > 0 

0<|at.|P|<m 

together with the standard homogeneous Dirichlet or Neumann boundary conditions, respectively. 
We illustrate the applicability of the general theory which we have developed above with a brief 
description (along with appropriate references to more detailed treatments) of some inverse 
problems of the form of problem (ED) which have arisen in practice and have been documented 
elsewhere in the literature. 


4.1 Size Structured Population Dynamics 

We consider an application of the Fokker Planck theory to model size structured population 
dynamics (see [20]). With the spatial domain representing the size distribution of the species in 
question, and u(t,x) denoting its population density at time t and size x, the assumption that growth 
or aging is a Markov transition process and an argument based upon a Brownian motion paradigm 
(see [14]) leads to the equation 

-^7 (M (t,x)u(t,x))=0 
dx ] J 

where for j = 1,2,... 
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Mj(t,x) = Um Q J (£- x) J (p(t,x; t + At, 5)d^. 


The kernel cp(t,x, t + At, Q is the probability density function for the transition from size x at time 
t to size £ at time t + At, and the functions Mj can be interpreted as the moments of the time rate of 
increase in size or as the time rate of change of the moments of the growth process. If we make the 
usual assumption that ]Vl(t,x) ~ 0 , j > 3, and set q^t.x) = M^qx) and q 2 (t,x) = -i- M 2 (t,x), 

we obtain the Fokker Planck equation 


(4.2) |^(t,x) --^y(q 2 (t,x)u(t,x)) + -^-(q 1 (t,x)u(t,x)) + q 0 (t,x)u(t,x) = 0, x Q < x < x p t>0 

where we have included a mortality term, qgu, and taken xq and x ^ to be respectively the minimum 
and maximum sizes of individuals in the population. 

Along with the linear partial differential equation (4.2), appropriate boundary conditions have 
to be specified. We shall impose a renewal type boundary condition at the minimum size, 


i 

(4.3) (q 1 (t,x)u(t,x) - -^-(q 2 (t,x)u(t,x))} x=Xo = J q 3 (t,x)u(t,x)dx, t>0 , 


x o 

and a zero flux condition (i.e. individuals can not grow beyond the maximum size) 
(4.4) (q 1 (t,x)u(t,x) - ^(q 2 (t,x)u(t,x))} x=Xi =0, t>0 


at the maximum size. (A more detailed discussion of the modeling considerations implicit in this 
choice of boundary conditions can be found in [2] and [6].) We assume that the size dependent 
population distribution at time t = 0 is given by u 0 (x), x Q < x < x^ and set 

(4.5) u(0,x) = u Q (x), x q <x<xi. 

In order to apply the model (4.2) - (4.5) effectively, one must be able to determine the 
moments q t and q 2 , the mortality rate qg and the fecundity kernel q 3 with some degree of accuracy, 
and consistently with biological principles and experimental observation. In problems of practical 
interest to population ecologists, one can expect only limited success in determining these 
parameters directly from knowledge of the growth process. An alternative is to formulate an 
inverse problem wherein the parameters q 0 , q l5 q 2 and q 3 would be determined from observations 
z(tj,x), x Q < x < Xj of the population density at times tj . 
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Let = L eo ([0,T] x [x 0 ,xj]) x LJ[0,T] x [x 0 , Xl ]) x LJO/T; W 1 -^)) x LJ[0,T] x 
[xq.Xj]) and let Q be a compact subset of ^ with the property that q = (q 0 , qj, q 2 , q 3 ) e Q if and 
only if q 2 (t,x) > v > 0 for almost every t £ [0, T] and every x e [x 0 , xj. Let H = L 2 (xq,x 1 ) with 
<•,•> denoting the usual L 2 inner product on H and set V = H^xq.Xj). For each q = (qQ, q j, 

92* Q 3 ) e Q an d almost every t £ [0, T] define the operator A(t;q) : V -» V* by 

(4.6) <A(t; q)(p ,\j/> = - <q^t ; -)<P - D(q 2 (t; -)cp), D\j/> 

x i 

-V(x 0 )J q 3 (t,x)cp(x)dx + <q 3 (t ; -)(p»¥> 
x o 

for cp, \\f £ V. Relatively straight forward arguments can be used to show that conditions (A) - (D) 
are satisfied with p = 2 (see [1]). If we define the least squares performance index 

x i 

m 1 

J(q) = d>(u(- ;q); z) = J (u(t,x;q) - z(t,x)) 2 dx 

i=1 x o 

where for each q E Q u(- ;q) £ C(0,T;H) is the unique solution to the initial value problem (2.1), 
(2.2) with A(t;q) given by (4.6) and f = 0, and z = (z(t p •),..Mz(t m , •)) e Z = x L 2 (x 0 ,x 1 ), 

then, with an appropriate choice of the Galerkin subspaces H n (see [1] and below), our theory 
applies. 

4.2 Biological Mixing in Sea Sediment Cores 

The modeling of biological mixing in lake and sea sediment cores yields a second example of 
a temporally inhomogeneous linear distributed system with an associated inverse problem. Lake 
and deep sea sediment core samples play an important role in geophysical research by providing a 
record of the geological, oceanographic, climatic, and biological history of the earth. Unfortunately 
however, the stratigraphic records contained in these core samples are frequently corrupted by the 
mixing activities of benthic organisms near the sea-floor- sea interface. Through the use of tracers 
from dated events (e.g. radioactive fallout, volcanic eruptions, etc.) data can be obtained from 
which the identification of an appropriate model for the mixing process is possible. Note that with 
an appropriate model, the deconvolution of the corrupted signal becomes a possibility. 

One approach to modeling the mixing process is based upon the assumption that mixing 
occurs only to a fixed depth t from the seafloor/sea interface. Under this assumption, the mixed 
layer in the sediment is modeled as a one dimensional moving diffusion chamber. The linear partial 
differential equation 

(4/7) ^ 7 - (t,x) - {q t (x) u(t,x)) + q 2 (t) u(t,x) = 0, 0<x<£, t>0 

results where u(t,x) denotes the concentration of tracer at time t and depth x in the mixed layer, q j 
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is a depth dependent diffusion coefficient (it is hypothesized that mixing is most intense at the 
seafloor/sea interface and then decreases with depth), and q 2 is a temporally varying sedimentation 
rate. If we assume that the concentration of tracer in the mixed layer at time t = 0 is given by Uq(x), 
0 < x < t, and that no other sources of tracer exist, we are led to the boundary and initial 
conditions given by 


(4.8) 


- QiOO (tix) + q 2 (t)u(t,x)J 0j t > o 


(4.9) 

(4.10) 


{- ■ 

x = ^. 

u(0,x) = Uq(x), 


= 0, t>0 

0<x< l. 


The boundary condition (4.8) states that there is no tracer flux through the top of the mixed layer 
while the boundary condition (4.9) implies that the tracer at depth i comes to rest in the stationary 
sediment layers below the mixed layer. (That is, as more sediment is deposited, since mixing is 
assumed to occur only to a fixed depth, the diffusion chamber moves up, leaving the tracer at the 
bottom behind.) 

The data for an associated inverse problem will come from analysis of the core sample. Our 
observations, z(t), t > 0, therefore, are the concentration of tracer at a depth x = Z in the mixed 
layer over some fixed time period, [0, T]. For brevity, we assume that u 0 and i are known, or 
can be determined either experimentally or from the data, and consider the inverse problem of 
identifying the functional parameters qj and q 2 . (Other inverse problems associated with the model 
given by (4.7) - (4.10), and a more detailed discussion of the modeling process itself can be found 
in [8].) 

Let q, =L oo (0,e) x L oo (0,T ) and let Q be a compact subset of 9> with the property that 
q = (qj, q 2 ) £ Q if and only if q^x) > v > 0 for almost every x e [0,£], We take H = (0,£) 

endowed with the standard inner product, <•,•>, set V = H 1 (0,£), and for q = (q 1? q 2 ) £ Q 

jjg 

and almost every t e [0,T] we define A(t; q) : V -» V by 

(4.11) <A(t;q)cp,Y> = <q 1 D(p,Dvp - q 2 (t)«p,D\[/> +q 2 (t)cp(£)\}/(£). 


for (p,\yeV. 

Using arguments similar to those given in [8], it is not difficult to show that conditions (A) - 
(D) are satisfied with p = 2. The appropriate performance index is given by 

T 

(4.12) J(q) = 0(u(- ;q);z) = J (u(t,£;q) - z(t)) 2 dt 

o 

where u (-,q) e L 2 (0, T;V) is the unique solution to the initial value problem (2.1), (2.2) with 
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A(t;q) given by (4.1 1) and f = 0, and z e Z = 1^(0, T). Note that 0(- ;z) given by (4.12) is 
continuous on 1^(0, T;V). Once again, with an appropriate choice of the H n , our theory applies. 


4.3 Nonlinear Heat Conduction 

A well known model for nonlinear heat conduction or mass transfer provides still another 
opportunity for the application of our framework. The example we present now can be considered 
as a special case of the general class of nonlinear systems which were discussed earlier in this 
section. However, for simplicity in notation we define directly the relevent operators and spaces 
in this particular example. Let £2 be a bounded region in with smooth boundary, let 

= L oo ((0,T ) x £2 x R'*') and let Q be a compact subset of with the property that q e Q if 
and only if 

(1) The mapping £ — > q(t,x,Q is C* for almost every (t,x) e [0,T] x £2, and 

(2) There exists a constant X > 0 which does not depend upon q e Q for which 

(4.13) 0. V ? q(t,x,0 l c=0 -(C - Tl) + q(t,x,0) (Cj - Tl.) > X(C . - Tip , i = 1,2,..., t 

for almost every (t,x) e [0, T] x £2 and all 0,£, rj e R^. (Note that when t=l, the function q(t,x, 
0 = q( 0 = 1 • .5e* ^ satisfies (4.13).) Then for q e Q, consider the nonautonomous, nonlinear 
model for heat conduction or mass transfer given by (see [17], [18]) 

(4.14) (t,x) - V • {q(t,x,Vu(t,x))Vu(t,x)} =0, xe£2, t>0 

ot 

together with appropriate boundary conditions. 

We take H = L^(£2) and take V to be an appropriately chosen closed subspace of H ! (£2) 
which contains H ^(£2). The precise choice of V of course depends upon the boundary conditions 
which accompany (4.14). For example with Diriehlet boundary conditions, we take V = H*(£2) 
and when Neumann boundary conditions have been specified we take V = H } (£2). For each 
q 8 Q and almost every t e [0, T], we define the operator A(t;q) : V — > V by 

<A(t; q)cp,\|/> = J* q(t,x,Vcp(x))V(p(x)V\j/(x)dx 
n 

for (p,\|/ e V. Once again, the hemicontinuity of the operator A(t ;q) and that conditions (A) - (D) 
are satisfied are not difficult to argue. 

Finally, we point out that it is usually not difficult to choose Galerkin subspaces H n and 
corresponding orthogonal projections P n which satisfy condition (E). It is frequently the case that 
choosing the H n as the span of appropriately modified Hermite or polynomial spline functions 
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(calling upon tensor products in higher dimensions) will suffice. For example, in one dimension, 
we suppose that H = 1^(0, 1) and V = H^O.l) endowed with the usual inner products and norms. 
For each n = 1,2,.. let { <p P ) ]L q denote the usual linear spline (or "hat") functions (see [16]) 
defined with respect to the uniform mesh {0, 1/n, 2/n,...,l } on the interval [0,1]. Set H n = span 
{ (p P } and let P n denote the orthogonal projection of H onto the n + 1 dimensional subspace H n 

with respect to the L2 inner product. For I n : V — > H n the interpolation operator defined by 
(I n cp)(j/n) = cp(j/n), j = 0,l,2,...,n, for (p eV, one can obtain (see [19]) the estimates 


ID*(I n 9 — cp)l < k..n i ' j ID j (pl, cpe^O.l), 

i = 0,...,j, j = 1,2, where the kjj are positive constants which do not depend on n or cp. It then 
follows that 

IP n cp - cp I < lljjCp-tpl^kjn'^ltpll, 
for (p eV, and by using the Schmidt inequality (see [16]) that 

(4.15) IIP n (p - cpll 2 = I P n cp - cpl 2 + I D(P n cp - tp)| 2 

< k 2 1 n 2 ll9ll 2 + kn 2 IP n 9-I n cp| 2 + 2|D(I n 9-9)| 2 

< v 2 II9II 2 

where v is a positive constant independent of n and 9 eV. Similar arguments can be used to show 

2 

that lim IIP n 9 - cpll = 0, for 9 e H (0, 1). This, together with density and (4.15), establishes 

condition (E). These ideas extend to higher order splines, other sets of boundary conditions (i.e. 
other choices for V), and higher dimensions via tensor products. 

Schemes based on Galerkin spline approximations as outlined in the previous paragraph can 
be used in a number of important areas of application. In [1], [2], [8] the reader will find 
computational results involving linear nonautonomous systems for which the theory in this paper 
provides a sound theoretical foundation. In addition, we are currently carrying out computations 
for several nonlinear examples including heat conduction models such as the one outlined in section 
4.3. These findings will be reported in a subsequent paper on numerical and implementation 
aspects of the ideas. 
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